Dual-microphone Robust Front-end for Arm’s-length Speech Recognition
نویسندگان
چکیده
This paper describes a novel method of improving the performance of a speech recognition front-end in non-stationary background noise. A two-microphone array has been designed that both enhances the speech and provides a continuous estimate of the background noise. This processing has been integrated with the standard ETSI DSR Advanced Front End so that the continuous noise estimate is an input to the first stage of Wiener filtering, increasing the noise suppression and hence the recognition performance. Tests with real-world noise have shown the recognition error rate is reduced by up to 50% when compared with single-microphone input to the same Advanced Front End.
منابع مشابه
A Two-Channel Acoustic Front-End for Robust Automatic Speech Recognition in Noisy and Reverberant Environments
An acoustic front-end for robust automatic speech recognition in noisy and reverberant environments is proposed in this contribution. It comprises a blind source separation-based signal extraction scheme and only requires two microphone signals. The proposed front-end and its integration into the recognition system is analyzed and evaluated in noisy living room-like environments according to th...
متن کاملNoise robust hands-free speech recognition using microphone array and Kalman filter as front-end system of conversational TV
In this paper, we investigate hands-free speech recognition as front-end system of conversational TV. The conversational TV is one of machine conversation systems to retrieve the interesting information by inquiring it to the TV. To realize the natural machine conversation without consciousness of microphone, hands-free speech recognition is required. In the handsfree speech recognition system,...
متن کاملModel-based independent component analysis for robust multi-microphone automatic speech recognition
In this communication, we present a method for noise-robust multimicrophone automatic speech recognition (ASR). It is assumed that the speech source to be recognized is recorded with several microphones in a noisy acoustic environment. The proposed method estimates the short-term subband energies (as they are needed for computing the ASR front-end) of the clean speech source from the ones of th...
متن کاملMicrophone array design for robust speech acquisition and recognition
The aim of this paper is to study the use of a robust acquisition system based on a microphone array for speech related applications in real situations. A comparison is performed between two beamforming methods: the Delay and Sum beamforming (DS) and the Spatial Reference Optimal beamforming (SRO). Both of them are frequency domain designed, using harmonic spatial distributed microphones. The q...
متن کاملSmoothed Nonlinear Energy Operator-Based Amplitude Modulation Features for Robust Speech Recognition
In this paper we present a robust feature extractor that includes the use of a smoothed nonlinear energy operator (SNEO)-based amplitude modulation features for a large vocabulary continuous speech recognition (LVCSR) task. SNEO estimates the energy required to produce the AM-FM signal, and then the estimated energy is separated into its amplitude and frequency components using an energy separa...
متن کامل